Physical Mapping by STS Hybridization :
نویسندگان
چکیده
An important tool in the analysis of genomic sequences is the physical map. In this paper we examine the construction of physical maps from hybridization data between STS (sequence tag sites) probes and clones of genomic fragments. An algorithmic theory of the mapping process, a proposed performance evaluation procedure, and several new algorithmic strategies for mapping are given. A unifying theme for these developments is the idea of a \conservative extension." An algorithm, measure of algorithm quality, or description of physical map is a conservative extension if it is a generalization for data with errors of a corresponding concept in the error-free case. In our algorithmic theory we show that the nature of hybridization experiments imposes inherent limitations on the mapping information recorded in the experimental data. We prove that only certain types of mapping information can be reliably calculated by any algorithm. A test generator is then presented along with quantitative measures for determining how much of the possible information is being computed by a given algorithm. Weaknesses and strengths of these measures are discussed. Each of the new algorithms presented in this paper is based on combinatorial optimizations. Despite the fact that all the optimizations are NP-complete, we have developed algorithmic tools for the design of competitive approximation algorithms. We apply our performance evaluation program to our algorithms and obtain solid evidence that the algorithms are capable of retrieving high-level reliable mapping information.
منابع مشابه
An STS content map of human chromosome 11: localization of 910 YAC clones and 109 islands.
Physical mapping of human chromosomes at a resolution of 100 kb to 1 Mb will provide important reagents for gene identification and framework templates for ultimately determining the complete DNA sequence. Sequence-tagged site (STS) content mapping, coupled with large fragment cloning in yeast artificial chromosomes, provides an efficient mechanism for producing first-generation, low-resolution...
متن کاملIntegrated mapping package--a physical mapping software tool kit.
We have developed an integrated physical mapping computer software package (IMP), originally designed to support the physical mapping of human chromosome 13 and expanded to support several gene-identification projects based on the positional candidate approach. IMP displays map data in a form that provides useful guidelines to the end users. An integrated map with high resolution and confidence...
متن کاملComparative mapping of human chromosome 3 genes in the pig shows different gene order
and Implications A comparative map of human chromosome 3 (HSA3) and pig chromosome 13 (SSC13) was constructed using physically assigned pig sequence tagged sites (STSs). Pig STS representing 11 HSA3 genes were developed and 10 pig STS were regionally mapped using a somatic cell hybrid panel (SCHP) to SSC13 with 80Ð100% concordance. Large-insert probes were obtained by screening a YAC library wi...
متن کاملEuchromatic Genome
A PCR-based sequence-tagged site (STS) content mapping strategy has been used to generate a physical map with 90% coverage of the 120-Mb euchromatic portion of the Drosophila genome. To facilitate map completion, the bulk of the STS markers was chosen in a nonrandom fashion. To ensure that all contigs were localized in relation to each other and the genome, these contig-building procedures were...
متن کاملImperfectness of Data for STS-Based Physical Mapping
In the STS-based mapping, we are requested to obtain the correct order of probes in a DNA sequence from a given set of fragments or equivalently a hybridization matrix A. It is well-known that the problem is formulated as the combinatorial problem of obtaining a permutation of A’s columns so that the resulting matrix has the consecutive-one property. If the data (the hybridization matrix) is er...
متن کامل